Semantic Web Tools for Categorization Greek Texts on the Internet: the MeDa13 standard and TeGO ontology
نویسنده
چکیده
The wider question of this study is the suitability of existing Web search engines for the needs of school education. It examines the relevance to the teaching objectives of the results returned by the search process given a query and its (stated or unstated) purpose in the context of an educational activity. The particular field of teaching and research interest is Modern Greek in Cypriot secondary education (third high school grade, G9). For the purpose of the research, the exploratory-descriptive and experimental approaches were adopted. The goal of this work is to automate the categorization of Greek texts available in the Internet into textual genres, according to their external (structural/format) and internal (linguistic, stylistic) features, using Semantic Web technology, i.e. metadata and ontologies. The long term vision is to build a semantic search engine that categorizes its results into textual genres as understood and used in high school teaching. The results so far highlighted the need for multi-categorization of texts, mainly due to their hybrid and multimodal nature. The contribution of this paper lies in the construction of a semantic search engine suitable for the needs of school education which returns results classified into textual genres, in accordance to their external and internal characteristics. An algorithm to categorize texts into genres was designed and tested on the basis of the standard metadata Dublin Core (DC). By adapting DC model to the needs of the present work, a model called ‘MeDa13’ (MetaData13) that includes thirteen elements, was designed. In addition, TeGO ontology (Textual Genres Ontology) was constructed in order to clarify concepts and terms that are stated in the definitions of textual genres. TeGO was developed specifically for teaching Modern Greek focusing on textual genres. In this paper, we describe the proposed models and present the target objectives and outcomes.
منابع مشابه
World Wide Web in the Service of Schooling: Semantic Web as a Solution for Language Teaching in Cypriot Secondary Education
This paper examines some suitability aspects of existing web search engines in relation to the content and the stated learning objectives of language teaching in Cypriot secondary education, focusing on the language course of the third high school grade (G9). The end goal is to put the internet in the service of schooling; specifically to categorize the results returned by the search engine int...
متن کاملPresenting a method for extracting structured domain-dependent information from Farsi Web pages
Extracting structured information about entities from web texts is an important task in web mining, natural language processing, and information extraction. Information extraction is useful in many applications including search engines, question-answering systems, recommender systems, machine translation, etc. An information extraction system aims to identify the entities from the text and extr...
متن کاملAn Executive Approach Based On the Production of Fuzzy Ontology Using the Semantic Web Rule Language Method (SWRL)
Today, the need to deal with ambiguous information in semantic web languages is increasing. Ontology is an important part of the W3C standards for the semantic web, used to define a conceptual standard vocabulary for the exchange of data between systems, the provision of reusable databases, and the facilitation of collaboration across multiple systems. However, classical ontology is not enough ...
متن کاملQuery Architecture Expansion in Web Using Fuzzy Multi Domain Ontology
Due to the increasing web, there are many challenges to establish a general framework for data mining and retrieving structured data from the Web. Creating an ontology is a step towards solving this problem. The ontology raises the main entity and the concept of any data in data mining. In this paper, we tried to propose a method for applying the "meaning" of the search system, But the problem ...
متن کاملDevelopment of a Combined System Based on Data Mining and Semantic Web for the Diagnosis of Autism
Introduction: Autism is a nervous system disorder, and since there is no direct diagnosis for it, data mining can help diagnose the disease. Ontology as a backbone of the semantic web, a knowledge database with shareability and reusability, can be a confirmation of the correctness of disease diagnosis systems. This study aimed to provide a system for diagnosing autistic children with a combinat...
متن کامل